首页> 外文OA文献 >Residential Demand Response of Thermostatically Controlled Loads Using Batch Reinforcement Learning
【2h】

Residential Demand Response of Thermostatically Controlled Loads Using Batch Reinforcement Learning

机译:使用批量强化学习的恒温控制负荷的住宅需求响应

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Driven by recent advances in batch Reinforcement Learning (RL), this paper contributes to the application of batch RL to demand response. In contrast to conventional model-based approaches, batch RL techniques do not require a system identification step, making them more suitable for a large-scale implementation. This paper extends fitted Q-iteration, a standard batch RL technique, to the situation when a forecast of the exogenous data is provided. In general, batch RL techniques do not rely on expert knowledge about the system dynamics or the solution. However, if some expert knowledge is provided, it can be incorporated by using the proposed policy adjustment method. Finally, we tackle the challenge of finding an open-loop schedule required to participate in the day-ahead market. We propose a model-free Monte Carlo method that uses a metric based on the state-action value function or Q-function and we illustrate this method by finding the day-ahead schedule of a heat-pump thermostat. Our experiments show that batch RL techniques provide a valuable alternative to model-based controllers and that they can be used to construct both closed-loop and open-loop policies.
机译:在批量强化学习(RL)的最新进展的推动下,本文为将RL应用于需求响应做出了贡献。与传统的基于模型的方法相比,批处理RL技术不需要系统识别步骤,从而使其更适合大规模实施。本文将拟合的Q迭代(一种标准的批处理RL技术)扩展到提供外部数据预测的情况。通常,批处理RL技术不依赖于有关系统动力学或解决方案的专业知识。但是,如果提供了一些专家知识,则可以使用建议的策略调整方法将其合并。最后,我们解决了寻找参与日前市场所需的开环时间表的挑战。我们提出了一种无模型的蒙特卡洛方法,该方法使用基于状态作用值函数或Q函数的度量,并通过查找热泵恒温器的日前时间表来说明该方法。我们的实验表明,批量RL技术可为基于模型的控制器提供有价值的替代方法,并且它们可用于构造闭环和开环策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号